Regret-Optimal Estimation and Control
نویسندگان
چکیده
We consider estimation and control in linear dynamical systems from the perspective of regret minimization. Unlike most prior work this area, we focus on problem designing causal state estimators controllers which compete against a clairvoyant noncausal policy, instead best policy selected hindsight some fixed parametric class. show that regret-optimal filters can be derived state-space form using operator-theoretic techniques robust control. Our results viewed as extending traditional control, focuses minimizing worst-case cost, to regret. propose analogs Model-Predictive Control (MPC) Extended Kalman Filter (EKF) for with nonlinear dynamics present numerical experiments these algorithms significantly outperform standard approaches
منابع مشابه
Optimal Control With Disturbance Estimation
The paper deals with a very common situation in many control systems and this is the fact that, for zero control action, the controlled variable is nonzero. This is often caused by the existence of another process input which is uncontrolled. Classic controllers do not take into account the second input, so deviation variables are considered or some feedforward controller is used to compensate ...
متن کاملOnline PCA with Optimal Regret
We investigate the online version of Principle Component Analysis (PCA), where in each trial t the learning algorithm chooses a k-dimensional subspace, and upon receiving the next instance vector xt, suffers the “compression loss”, which is the squared Euclidean distance between this instance and its projection into the chosen subspace. When viewed in the right parameterization, this compressio...
متن کاملOptimal Neural Codes for Control and Estimation
Agents acting in the natural world aim at selecting appropriate actions based on noisy and partial sensory observations. Many behaviors leading to decision making and action selection in a closed loop setting are naturally phrased within a control theoretic framework. Within the framework of optimal Control Theory, one is usually given a cost function which is minimized by selecting a control l...
متن کاملOptimal Population Codes for Control and Estimation
Agents acting in the natural world aim at selecting appropriate actions based on noisy and partial sensory observations. Many behaviors leading to decision making and action selection in a closed loop setting are naturally phrased within a control theoretic framework. Within the framework of optimal Control Theory, one is usually given a cost function which is minimized by selecting a control l...
متن کاملNear-optimal Regret Bounds for Reinforcement Learning Near-optimal Regret Bounds for Reinforcement Learning
This technical report is an extended version of [1]. For undiscounted reinforcement learning in Markov decision processes (MDPs) we consider the total regret of a learning algorithm with respect to an optimal policy. In order to describe the transition structure of an MDP we propose a new parameter: An MDP has diameter D if for any pair of states s, s there is a policy which moves from s to s i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Automatic Control
سال: 2023
ISSN: ['0018-9286', '1558-2523', '2334-3303']
DOI: https://doi.org/10.1109/tac.2023.3253304